Grammatical Phrase-Level Opinion Target Extraction on Chinese Microblog Messages
نویسندگان
چکیده
Microblog is one of the most widely used web applications. Weibo, which is a microblog service in China, produces plenty of opinionated messages every second. Sentiment analysis on Chinese Weibo impacts many aspects of business and politics. In this work, we attempt to address the opinion target extraction, which is one of the most important aspects of sentiment analysis. We propose a unified approach that concentrates on phrase-level target extraction. We assume that a target is represented as a subgraph of the sentence’s dependency tree and define the grammatical relations that point to the target word as TARRELs. We conduct the extraction by classifying grammatical relations with a cost-sensitive classifier that enhances performance of unbalanced data and figuring out the target subgraph by connecting and recovering TAR-RELs. Then we prune the noisy targets by empirically summarized rules. The evaluation results indicate that our approach is effective to the phrase-level target extraction on Chinese microblog messages.
منابع مشابه
Collective Opinion Target Extraction in Chinese Microblogs
Microblog messages pose severe challenges for current sentiment analysis techniques due to some inherent characteristics such as the length limit and informal writing style. In this paper, we study the problem of extracting opinion targets of Chinese microblog messages. Such fine-grained word-level task has not been well investigated in microblogs yet. We propose an unsupervised label propagati...
متن کاملOpinion Sentence Extraction and Sentiment Analysis for Chinese Microblogs
Sentiment analysis of Chinese microblogs is important for scientific research in public opinion supervision, personalized recommendation and social computing. By studying the evaluation task of NLP&CC’2012, we mainly implement two tasks, namely the extraction of opinion sentence and the determination of sentiment orientation for microblogs. First, we manually label the sample of microblog corpu...
متن کاملTopic Extraction from Microblog Posts Using Conversation Structures
Conventional topic models are ineffective for topic extraction from microblog messages since the lack of structure and context among the posts renders poor message-level word co-occurrence patterns. In this work, we organize microblog posts as conversation trees based on reposting and replying relations, which enrich context information to alleviate data sparseness. Our model generates words ac...
متن کامل基於意見詞修飾關係之微網誌情感分析技術 (Microblog Sentiment Analysis based on Opinion Target Modifying Relations) [In Chinese]
متن کامل
Grammatical Relations in Chinese: GB-Ground Extraction and Data-Driven Parsing
This paper is concerned with building linguistic resources and statistical parsers for deep grammatical relation (GR) analysis of Chinese texts. A set of linguistic rules is defined to explore implicit phrase structural information and thus build high-quality GR annotations that are represented as general directed dependency graphs. The reliability of this linguistically-motivated GR extraction...
متن کامل